Using statistical models to predict phrase boundaries for speech synthesis
نویسندگان
چکیده
This paper describes a variety of methods for inserting phrase boundaries in text. The methods work by ex amining the likelihood of a phrase break occurring in a sequence of three part-of-speech tags. The paper explains this basic technique and desribes more sophisticaed vari ations using distance probabilities.
منابع مشابه
Automatic phrase boundary labeling of speech synthesis database using context-dependent HMMs and n-gram prior distributions
This paper presents an automatic phrase boundary labeling method for speech synthesis database annotation using contextdependent hidden Markov models (CD-HMMs) and n-gram prior distributions. At training stage, CD-HMMs are built to describe the conditional distribution of acoustic features given phonetic label and phrase boundary. In addition, n-gram models are estimated to represent the prior ...
متن کاملتعیین مرز و نوع عبارات نحوی در متون فارسی
Text tokenization is the process of tokenizing text to meaningful tokens such as words, phrases, sentences, etc. Tokenization of syntactical phrases named as chunking is an important preprocessing needed in many applications such as machine translation information retrieval, text to speech, etc. In this paper chunking of Farsi texts is done using statistical and learning methods and the grammat...
متن کاملAcoustic Cues for Automatic Determination of Phrasing
This paper proposes a framework of automatic determination of phrasing using acoustic features derived from the speech signal. The feature vectors were defined in a series of analyses investigating the acoustic-phonetic realization of minor and major phrase boundaries and different boundary types. The resulting representation was used to train statistical classifiers to automatically determine ...
متن کاملModeling of sentence-medial pauses in bangla readout speech: occurrence and duration
Control of pause occurrence and duration is an important issue for text-to-speech synthesis systems. In text-readout speech, pauses occur unconditionally at sentence boundaries and with high probability at major syntactic boundaries such as clause boundaries, but more or less arbitrarily at minor syntactic boundaries. Pause duration tends to be longer at the end of a longer syntactic unit. A de...
متن کاملAccent type and phrase boundary estimation using acoustic and language models for automatic prosodic labeling
This paper proposes an automatic prosodic labeling technique for constructing speech database used for speech synthesis. In the corpus-based Japanese speech synthesis, it is essential to use annotated speech data with prosodic information such as phrase boundaries and accent types. However, manual annotation is generally time-consuming and expensive. To overcome this problem, we propose an esti...
متن کامل